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INTRODUCTION 

2 Technical Field 

The technical field of this invention concerns peptides, polypeptides, and 
polynucleotides involved in nerve cell growth. 

25 

Background 

The specificity of the wiring of the nervous system - the complex pattern of specific 
synaptic connections — begins to unfold during development as the growing tips of neurons - 
the growth cones - traverse long distances to find their correct targets. Along their journey, 
30 they are confronted by and correctly navigate a series of choice points in a remarkably 
unerring way to ultimately contact and recognize their correct target. 

The identification of growth cone guidance cues is to a large extent, the holy grail of 
neurobiology. These are the compounds that tell neurons when to grow, where to grow, and 
when to stop growing. The medical applications of such compounds and their antagonists are 
35 enormous and include modulating neuronal growth regenerative capacity, treating 

neurodegenerative disease, and mapping (e.g. diagnosing) genetic neurological defects. 

Over decades of concentrated research, various hypotheses of chemo-attractants and 
repellant, labeled pathways, cell adhesion molecules, etc. have been evoked to explain 
guidance. Recently, several recent lines of experiments suggest repulsion may play an 



important role in neuron guidance and two apparently unrelated factors ("Neurite Growth 
Inhibitor" and "Collapsin") capable of inhibiting or collapsing growth cones have been 
reported. 

5 Relevant Literature 

For a recent review of much of the literature in this field, see Goodman and Shatz 
(1993) Cell 72/Neuron 10, 77-98. A description of grasshopper fasciclin IV (now called G- 
Semaphorin I) appears in Kolodkin et al. (1992) Neuron 9, 831-845. Recent reports on 
Collapsin and Neurite Growth Inhibitor include Raper and Kapfhammer ( 1 990) Neuron 4,21- 
10 29, an abstract presented by Raper at the GIBCO-BRL Symposium on "Genes and 

Development/Function of Brain" on July 26, 1993 and Schwab and Caroni (1988) J Neurosci 
8, 2381 and Schnell and Schwab (1990) Nature 343, 269, respectively. 

SUMMARY OF THE INVENTION 
liS A novel class of proteins, semaphorins, nucleic acids encoding semaphorins, and 

2 methods of using semaphorins and semaphorin-encoding nucleic acids are disclosed. 
SI Semaphorins include the first known family of human proteins which function as growth 
^ cone inhibitors and a family of proteins involved in viral, particularly pox viral, pathogenesis 
m and oncogenesis. Families of semaphorin-specific receptors, including receptors found on 
20 nerve growth cones and immune cells are also disclosed. 

Jr{ The invention provides agents, including semaphorin peptides, which specifically 

□ bind semaphorin receptors and agents, including semaphorin receptor peptides, which 
% specifically bind semaphorins. These agents provide potent modulators of nerve cell growth, 
m immune responsiveness and viral pathogenesis and find use in the treatment and diagnosis of 
25 neurological disease and neuro-regeneration, immune modulation including hypersensitivity 

and graft-rejection, and diagnosis and treatment of viral and oncological infection/diseases. 
Semaphorins, semaphorin receptors, semaphorin-encoding nucleic acids, and unique 

portions thereof also find use variously in screening chemical libraries for regulators of 

semaphorin or semaphorin receptor-mediated cell activity, in genetic mapping, as probes for 
30 related genes, as diagnostic reagents for genetic neurological, immunological and oncological 

disease and in the production of specific cellular and animal systems for the development of 

neurological, immunological, oncological and viral disease therapy. 

DESCRIPTION OF SPECIFIC EMBODIMENTS 
35 The present invention discloses novel families of proteins important in nerve and 

immune cell function: the semaphorins and the semaphorin receptors. The invention provides 
agents, including semaphorin peptides, which specifically bind semaphorin receptors and 
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agents, including semaphorin receptor peptides, which specifically bind semaphorins. These 
agents find a wide variety of clinical, therapeutic and research uses, especially agents which 
modulate nerve and/or immune cell function by specifically mimicing or interfering with 
semaphorin-receptor binding. For example, selected semaphorin peptides shown to act as 
5 semaphorin receptor antagonists are effective by competitively inhibiting native semaphorin 
association with cellular receptors. Thus, depending on the targeted receptor, these agents 
can be used to block semaphorin mediated neural cell growth cone repulsion or contact 
inhibition. Such agents find broad clinical application where nerve cell growth is indicated, 
e.g. traumatic injury to nerve cells, neurodegenerative disease, etc. A wide variety of 
10 semaphorin- and semaphorin receptor-specific binding agents and methods for identifying, 
making and using the same are described below. 

Binding agents of particular interest are semaphorin peptides which specifically bind 
and antagonize a semaphorin receptor and semaphorin receptor peptides which specifically 

□ bind a semaphorin and prevent binding to a native receptor. While exemplified primarily 
15 with semaphorin peptides, much of the following description applies analogously to 

\j semaphorin receptor peptides. 

EP The semaphorin peptides of the invention comprise a unique portion of a semaphorin 

jjj and have semaphorin binding specificity. A "unique portion" of a semaphorin has an amino 
s acid sequence unique to that disclosed in that it is not found in any previously known protein. 
2f|! Thus a unique portion has an amino acid sequence length at least long enough to define a 

□ novel peptide. Unique semaphorin portions are found to vary from about 5 to about 25 

f= residues, preferably from 5 to 10 residues in length, depending on the particular amino acid 
sequence. Unique semaphorin portions are readily identified by comparing the subject 
semaphorin portion sequences with known peptide/protein sequence data bases. Preferred 

25 unique portions derive from the semaphorin domains (which exclude the Ig-like, intracellular 
and transmembrane domains as well as the signal sequences) of the disclosed semaphorin 
sequences, especially regions that bind the semaphorin receptor, especially that of the human 
varieties. Preferred semaphorin receptor unique portions derive from the semaphorin binding 
domains, especially regions with residues which contact the semaphorin ligand, especially 

30 that of the human varieties. Particular preferred peptides are further described herein. 

The subject peptides may be free or coupled to other atoms or molecules. Frequently 
the peptides are present as a portion of a larger polypeptide comprising the subject peptide 
where the remainder of the polypeptide need not be semaphorin- or semaphorin receptor- 
derived. Alternatively, the subject peptide may be present as a portion of a "substantially 

35 full-length" semaphorin domain or semaphorin receptor sequence which comprises or 

encodes at least about 200, preferably at least about 250, more preferably at least about 300 
amino acids of a disclosed semaphorin/receptor sequence. Thus the invention also provides 
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polypeptides comprising a sequence substantially similar to that of a substantially full-length 
semaphorin domain or a semap horin receptor. "Substantially similar" sequences share at least 
about 40%, more preferably at least about 60%, and most preferably at least about 80% 
sequence identity. Where the sequences diverge, the differences are generally point 
5 insertions/deletions or conservative substitutions, i.e. a cysteine/threonine or serine 

substitution, an acidic/acidic or hydrophobic/hydrophobic amino acid substitution, etc. 

The subject semaphorin peptides/polypeptides are "isolated", meaning 
unaccompanied by at least some of the material with which they are associated in their natural 
state. Generally, an isolated peptide/polypeptide constitutes at least about 1 %, preferably at 
10 least about 10%, and more preferably at least about 50% by weight of the total peptide/protein 
in a given sample. By pure peptide/polypeptide is intended at least about 90%, preferably at 
least 95%, and more preferably at least about 99% by weight of total peptide/protein. 
Included in the subject peptide/polypeptide weight are any atoms, molecules, groups, or 
u polymers covalently coupled to the subject semaphorin/receptor peptide/polypeptide, 
15j especially peptides, proteins, detectable labels, glycosylations, phosphorylations, etc. 

The subject peptides/polypeptides may be isolated or purified in a variety of ways 
sj known to those skilled in the art depending on what other components are present in the 
y I sample and to what, if anything, the peptide/polypeptide is covalently linked. Purification 

methods include electrophoretic, molecular, immunological and chromatographic techniques, 
20 especially affinity chromatography and RP-HPLC in the case peptides. For general guidance 
^ in suitable purification techniques, see Scopes, R., Protein Purification, Springer- Verlag, NY 
3 (1982). 

j The subject peptides/polypeptides generally comprise naturally occurring amino acids 

m but D-amino acids or amino acid mimetics coupled by peptide bonds or peptide bond 

25 mimetics may also be used. Amino acid mimetics are other than naturally occurring amino 
acids that conformationally mimic the amino acid for the purpose of the requisite 
semaphorin/receptor binding specificity. Suitable mimetics are known to those of ordinary 
skill in the art and include p-y-6 amino and imino acids, cyclohexylalanine, adamantylacetic 
acid, etc., modifications of the amide nitrogen, the a-carbon, amide carbonyl, backbone 

30 modifications, etc. See, generally, Morgan and Gainor (1989) Ann. Repts. Med. Chem 24, 

243-252; Spatola (1983) Chemistry and Biochemistry of Amino Acids, Peptides and Proteins, 
Vol VH (Weinstein) and Cho et. al (1993) Science 261, 1303-1305 for the synthesis and 
screening of oligocarbamates. 

The subject semaphorin peptides/polypeptides have a "semaphorin binding 

35 specificity" meaning that the subject peptide/polypeptide retains a molecular conformation 
specific to one or more of the disclosed semaphorins and specifically recognizable by a 
semaphorin-specific receptor, antibody, etc. As such, a semaphorin binding specificity may 
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be provided by a semaphorin-specific immunological epitope, lectin binding site, etc., and 
preferably, a receptor binding site. Analogously, the semaphorin receptor 
peptides/polypeptides have a "semaphorin receptor binding specificity" meaning that these 
peptides/polypeptides retain a molecular conformation specific to one or more of the 
5 disclosed semaphorin receptors and specifically recognizable by a semaphorin, a receptor- 
specific antibody, etc. 

"Specific binding" is empirically determined by contacting, for example a 
semaphorin-derived peptide with a mixture of components and identifying those components 
that preferentially bind the semaphorin. Specific binding is most conveniently shown by 
10 competition with labeled ligand using recombinant semaphorin peptide either in vitro or in 
cellular expression systems as disclosed herein. Generally, specific binding of the subject 
semaphorin has binding affinity of 10" 6 M, preferably 10" 8 M, more preferably 10 10 M, under in 
vitro conditions as exemplified below. 

The peptides/polypeptides may be modified or joined to other compounds using 
131 physical, chemical, and molecular techniques disclosed or cited herein or otherwise known to 
2 those skilled in the relevant art to affect their semaphorin binding specificity or other 
'%* properties such as solubility, membrane transportability, stability, binding specificity and 
0? affinity, chemical reactivity, toxicity, bioavailability, localization, detectability, in vivo half- 
Si life, etc. as assayed by methods disclosed herein or otherwise known to those of ordinary skill 
20 in the art. For example, point mutations are introduced by site directed mutagenesis of 
M nucleotides in the DNA encoding the disclosed semaphorin polypeptides or in the course of in 
O vitro peptide synthesis. 

% Other modifications to further modulate binding specificity/affinity include 

m chemical/enzymatic intervention (e.g. fatty acid-acylation, proteolysis, glycosylation) and 

25 especially where the peptide/polypeptide is integrated into a larger polypeptide, selection of a 
particular expression host, etc. In particular, many of the disclosed semaphorin peptides 
contain serine and threonine residues which are phosphorylated or dephosphorylated. See e.g. 
methods disclosed in Roberts et al. (1991) Science 253, 1022-1026 and in Wegner et al. 
(1992) Science 256, 370-373. Amino and/or carboxyl termini may be functionalized e.g., for 

30 the amino group, acylation or alkylation, and for the carboxyl group, esterification or 

amidification, or the like. Many of the disclosed semaphorin peptides/polypeptides also 
contain glycosylation sites and patterns which may disrupted or modified, e.g. by enzymes 
like glycosidases or used to purify/identify the receptor, e.g. with lectins. For instance, N or 
O-linked glycosylation sites of the disclosed semaphorin peptides may be deleted or 

35 substituted for by another basic amino acid such as Lys or His for N-linked glycosylation 
alterations, or deletions or polar substitutions are introduced at Ser and Thr residues for 
modulating O-linked glycosylation. Glycosylation variants are also produced by selecting 
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appropriate host cells, e.g. yeast, insect, or various mammalian cells, or by in vitro methods 
such as neuraminidase digestion. Useful expression systems include COS-7, 293, BHK, 
CHO, TM4, CV1, VERO-76, HELA, MDCK, BRL 3 A, W138, Hep G2, MMT 060562, TRI 
cells, baculovirus systems, for examples. Other covalent modifications of the disclosed 
semaphorin peptides/polypeptides may be introduced by reacting the targeted amino acid 
residues with an organic derivatizing (e.g. methyl-3-[(p-azido-phenyl)dithio] propioimidate) 
or crosslinking agent (e.g. l,l-bis(diazoacetyl)-2-phenylethane) capable of reacting with 
selected side chains or termini. For therapeutic and diagnostic localization, semaphorins and 
peptides thereof may be labeled directly (radioisotopes, fluorescers, etc.) or indirectly with an 
agent capable of providing a detectable signal, for example, a heart muscle kinase labeling 
site. 

The following are 14 classes of preferred semaphorin peptides where bracketed 
positions may be occupied by any one of the residues contained in the brackets and "X" 
signifies that the position may be occupied by any one of the 20 naturally encoded amino 
acids (see, Table 1). These enumerated peptides maintain highly conserved structures which 
provide important semaphorin binding specificities; 

(a) [ AspGlu]Cys[GlnLysArgAlaAsn] Asn[TyrPheVal]Ile (SEQ ID NO: 1 ) 
Cys[GlnLysArgAlaAsn]Asn[TyrPheVal]Ile[ArgLysGlnThr] (SEQ ID NO:2) 

(b) CysGlyThr[AsnGly][AlaSerAsn][TyrPheHisGly][LysArgHisAsnGln] (SEQ ID NO: 3) 

CysGIyThr[AsnGly][AlaSerAsn]XaaXaaPro (SEQ ID NO:4) 
CysGlyThr[AsnGly]XaaXaaXaaProXaa[CysAsp] (SEQ ID NO:5) 
CysGlyThrXaaXaaXaaXaaProXaa[CysAsp]XaaXaa[TyrIle] (SEQ ID NO:6) 

(c) [ArgneGlnVal][GlyAla][LeuValLys][CysSer]Pro[PheTyr][AspAsn] (SEQ ID NO:7) 

[CysSer]Pro[PheTyr][AspAsn]Pro[AspGluArgLys][HisLeuAsp] (SEQ ID NO:8) 
GlyXaa[GlyAla]Xaa[CysSer]ProTyr[AspAsn]Pro (SEQ ID NO:9) 

(d) Leu[PheTyr]Ser[GlyAla]Thr[ValAsnAla] Ala (SEQ ID NO: 10) 
Leu[PheTyr]SerXaaThrXaaAla[AspGlu][PheTyr] (SEQ ID NO:l 1) 
[PheTyr]Ser[GlyAla]Thr[ValAsnAla]Ala[AspGlu][PheTyr] (SEQ ID NO: 12) 

(e) Leu[AsnAsp][AlaLys]ProAsnPheVal (SEQ ID NO: 13) 

(f) PhePhePhe ArgGlu (SEQ ID NO: 1 4) 
PhePhe[PheTyr]ArgGlu[ThrAsn] (SEQ ID NO: 15) 
PhePheArgGlu[ThrAsn]Ala (SEQ ID NO: 16) 
Phe[PheTyr]ArgGlu[ThrAsn]Ala (SEQ ID NO: 17) 
TyrPhePhe[PheTyr] ArgGlu (SEQ ID NO: 18) 
[PheTyr]PhePhe[PheTyr] ArgGlu (SEQ ID NO: 19) 
[PheTyr][PheTyr][PheTyr]ArgGlu[ThrAsn]Ala (SEQ ID NO:20) 
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[neVal][PheTyr]Phe[PheTyr][PheTyr]ArgGlu (SEQ ID NO:21) 
Asp[LysPheTyr]Val[PheTyr][PheTyrneLeu][PheTyrneLeu][PheTyr] (SEQ ID NO:22) 
[ValHe][PheTyr][PheTyrEeLeu][PheTyrneLeu]Phe[ArgThr]Xaa[ThrAsn](SEQIDNO:23) 
[ Vallle] [PheTyr] [PheTyrlleLeu] [PheTyrlleLeu] [PheTyr] [ArgThr] [GluAspVal] [Thr Asn] 
5 (SEQ ID NO:24) 

(g) Glu[PheTyr]HeAsn[CysSer]GlyLys (SEQ ID NO:25) 
[PheTyr]IleAsnCysGlyLys[AlaVame] (SEQ ID NO:26) 

(h) Arg[Vaine][AlaGly][ArgGln][Vame]CysLys (SEQ ID NO:27) 
Arg[ValHe]Xaa[ArgGln][ValIle]CysXaaXaaAsp (SEQ ID NO:28) 

10 GlyLys[ValAlane]XaaXaaXaaArg[ValAlane]XaaXaaXaaCysLys (SEQ ID NO:29) 

(i) [ArgLysAsn]Trp[ThrAlaSer][ThrAlaSer][PheTyrLeu]Leu[LysArg] (SEQ ID NO: 30) 

[PheTyr]Leu[LysArg][AlaSer]ArgLeu[Asnne]Cys (SEQ ID NO:31) 
[AsnIle]CysSer[IleVal][ProSer]Gly (SEQ ID NO:32) 
Trp[ThrAlaSer][ThrAlaSer][PheTyrLeu]LeuLys[AlaSerVaffleLeu]XaaLeu (SEQ ID NO:33) 
l[| Trp[ThrAlaSer][ThrAlaSer]XaaLeuLysXaaXaaLeuXaaCys (SEQ ID NO:34) 

0 TrpXaa[ThrSer]XaaLeuLysXaaXaaLeuXaaCys (SEQ ID NO:35) 
Jj (j) [PheTyr] [PheTyr] [AsnAsp]GluLeGlnSer (SEQ ID NO:36) 

m [PheTyr]Pro[PheTyr] [PheTyr] [PheTyr] [AsnAsp]Glu (SEQ ID NO: 37) 

;r[ (k) GlySerAla[VaineLeu]CysXaa[PheTyr] (SEQ ID NO:38) 
20 SerAla[ValIleLeu]CysXaa[PheTyr]XaaMet (SEQ ID NO:39) 

□ (1) AsnSer[AsnAla]TrpLeu[ProAla]Val (SEQ ID NO:40) 

1 (m) [ValLeuHe]Pro[GluAspTyrSerPhe]ProArgProGly (SEQIDNO:41) 
5 [ValLeune]ProXaaPro[ArgAla]ProGlyXaaCys (SEQ ID NO:42) 

gj Pro[GluAspTyrSerPhe]ProArgProGly[ThrGlnSer]Cys (SEQ ID NO:43) 

25' (n) AspPro[HisPheTyr]Cys[AlaGly]Trp (SEQ ID NO:44) 
Pro[HisPheTyr]Cys[AlaGly]TrpAsp (SEQ ID NO:45) 
AspProXaaCys[AlaGly]TrpAsp (SEQ ID NO:46) 
CysXaaXaaXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:47) 
CysXaaXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:48) 

30 CysXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:49) 

CysXaaXaaCysXaaXaaXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO:50) 
CysXaaXaaCysXaaXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO:51) 
CysXaaXaaCysXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO: 52). 

35 The following peptides represent particularly preferred members of each class: 

(a) AspCysGlnAsnTyrlle (SEQ ID NO: 67) 
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(b) CysGlyThr[AsnGly] [AlaSerJXaaXaaPro (SEQ ID NO:68) 

(c) GlyXaa[SerCys]ProTyrAspPro (SEQ ID NO:69) 

(d) LeuTyrSerGlyThr[ValAsnAla]Ala (SEQ ID NO.70) 

(e) LeuAsnAlaProAsnPheVal (SEQ ID NO:7 1 ) 

5 (f) [PheTyr]PhePhe[PheTyr] ArgGlu (SEQ ID NO: 19) 

(g) Glu[PheTyr]IleAsn[CysSer]GlyLys (SEQ ID NO:25) 

(h) Arg[ValIle]AlaArgValCysLys (SEQ ID NO:72) 

(i) Trp[ThrAla] [ThrSer] [PheTyr]LeuLys[AlaSer] ArgLeu (SEQ ID NO:73) 
(j) ProPheTyrPhe[AsnAsp]GluEeGlnSer (SEQ ID NO:74) 

10 (k) GlySerAlaValCysXaa[PheTyr] (SEQ ID NO:75) 

(1) AsnSerAsnTrpLeu[ProAla]Val (SEQ ID NO:76) 
(m) Pro[GluAsp]ProArgProGly[ThrGlnSer]Cys (SEQ ID NO:77) 
(n) AspProTyrCys[AlaGly]TrpAsp (SEQ ID NO:78). 

1;5? The following 14 classes are preferred peptides which exclude semaphorin peptides 

eft encoded in open reading frames of Variola major or Vaccinia viruses. 

1j (a) [AspGlu]Cys[GlnLysArgAlaAsn]Asn[TyrPheVal]Ile (SEQ ID NO:01) 

fU Cys[GlnLysArgAlaAsn]Asn[TyrPheVal]He[ArgLysGlnThr] (SEQ ID NO:02) 

20, (b) CysGlyThr[AsnGly][AlaSer][TyrPheHisGly][LysArgHisAsnGln] (SEQ ID NO:79) 
ftj CysGlyThr[AsnGly][AlaSerAsn][TyrPheHis][LysArgHisAsnGln] (SEQ ID NO:80) 
| CysGlyThr[AsnGly][AlaSer]XaaXaaPro (SEQ ID NO:81) 

5 (c)[ArgHeGlnVal] [GlyAla] [LeuValLys] [CysSer]Pro[PheTyr] [AspAsn] (SEQ ID NO:07) 
fU [CysSer]Pro[PheTyr] [AspAsn] Pro [AspGluArgLys] [HisLeuAsp] (SEQ ID NO:08) 

25 GlyXaa[GlyAla]Xaa[CysSer]ProTyr[AspAsn]Pro (SEQ ID NO:09) 

(d) Leu[PheTyr]Ser[GlyAla]Thr[ValAsnAla] Ala (SEQ ID NO: 10) 
Leu[PheTyr]SerXaaThrXaaAla[AspGlu][PheTyr] (SEQ ID NO: 11) 
[PheTyr]Ser[GlyAla]Thr[ValAsnAla]Ala[AspGlu][PheTyr] (SEQ ID NO: 12) 

(e) Leu[AsnAsp] [ AlaLys]ProAsnPheVal (SEQ ID NO: 1 3) 
30 (f) PhePhePhe ArgGlu (SEQ ID NO: 1 4) 

PhePhe[PheTyr]ArgGlu[ThrAsn] (SEQ ID NO: 15) 
PhePheArgGlu[ThrAsn]Ala (SEQ ID NO: 16) 
Phe[PheTyr]ArgGlu[ThrAsn]Ala (SEQ ID NO: 17) 
TyrPhePhe[PheTyr] ArgGlu (SEQ ID NO: 18) 
35 [PheTyr]PhePhe[PheTyr] ArgGlu (SEQ ID NO: 19) 

[PheTyr][PheTyr][PheTyr]ArgGlu[ThrAsn]Ala (SEQ ID NO:20) 
[HeVal][PheTyr]Phe[PheTyr][PheTyr] ArgGlu (SEQ ID NO:21) 
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Asp[LysPheTyr]Val[PheTyr][PheTyrLeu][PheTyrneLeu][PheTyr] (SEQ ID NO:22) 
Asp[LysPheTyr]Val[PheTyr][PheTyrEeLeu][PheTyrne][PheTyr] (SEQ ID NO: 82) 
[Vame][PheTyr][PheTyrLeu][PheTyrLeLeu]Phe[ArgThr]Xaa[ThrAsn] (SEQ ID NO:83) 
[ValEe][PheTyr][PheTyrneLeu][PheTyrne]Phe[ArgThr]Xaa[ThrAsn] (SEQ ID NO:84) 
5 [Value] [PheTyr] [PheTyrHeLeu] [PheTyrneLeu]PheArgXaa[ThrAsn] (SEQ ID NO : 85) 

[Vallle] [PheTyr] [PheTyrLeu] [PheTyrHeLeu] [PheTyr] [ArgThr] [GluAspVal] [ThrAsn] (SEQ 
ID NO:86) 

(g) Glu[PheTyr]IleAsn[CysSer]GlyLys (SEQ ID NO:25) 

[PheTyr]BeAsnCysGlyLys[AlaValIle] (SEQ ID NO:26) 
1 0 (h) Arg[ValHe] [AlaGly] [ArgGln] [ValIle]CysLys (SEQ ID NO:27) 

Arg[ValHe]Xaa[ArgGln][Vame]CysXaaXaaAsp (SEQ ID NO:28) 
GlyLys[ValAlane]XaaXaaXaaArg[ValAlaIle]XaaXaaXaaCysLys (SEQ ID NO:29) 
(i) [ ArgLys Asn]Trp [ThrAla] [ThrAlaSer] [PheTyrLeu] Leu [LysArg] (SEQ ID NO: 87) 
if [PheTyr]Leu[LysArg] [AlaSer] ArgLeu[AsnBe]Cys (SEQ ID NO:3 1 ) 

lg [ Asnne]CysSer[neVal] [ProSer]Gly (SEQ ID NO: 32) 

CH Trp[ThrAla] [ThrAlaSer] [PheTyrLeu]LeuLys[AlaSerValIleLeu]XaaLeu (SEQ ID NO:88) 
; Jj Trp[ThrAlaSer] [ThrAlaSer] [PheTyrLeu]LeuLys[AlaSerIleLeu]XaaLeu (SEQ ID NO:89) 
W Trp[ThrAla] [ThrAlaSer]XaaLeuLysXaaXaaLeuXaaCys (SEQ ID NO:90) 

(j) [PheTyr][PheTyr][AsnAsp]GluueGlnSer(SEQIDNO:36) 
2gj [PheTyr]Pro[PheTyr] [PheTyr] [PheTyr] [AsnAsp]Glu (SEQ ID NO: 37) 

J (k) GlySerAla[VaineLeu]CysXaa[PheTyr] (SEQ ID NO: 38) 
% SerAla[ValHe]CysXaa[PheTyr]XaaMet (SEQ ID NO:39) 

Q (1) AsnSer[AsnAla]TrpLeu[ProAla]Val (SEQ ID NO:40) 
1 y (m) [ValLeuIle]Pro[GluAspTyrSerPhe]ProArgProGly (SEQ ID NO:4 1 ) 
25 [ ValLeuIle]ProXaaProArgProGlyXaaCys (SEQ ID NO: 9 1 ) 

Pro[GluAspTyrSerPhe]ProArgProGly[ThrGlnSer]Cys (SEQ ID NO:43) 
(n) AspPro[HisPheTyr]Cys[AlaGly]Trp (SEQ ID NO:44) 
Pro[HisPheTyr]Cys[AlaGly]TrpAsp (SEQ ID NO:45) 
AspProXaaCys[AlaGly]TrpAsp (SEQ ID NO:46) 
30 CysXaaXaaXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:47) 

CysXaaXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:48) 
CysXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:49) 
CysXaaXaaCysXaaXaaXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO:50) 
CysXaaXaaCysXaaXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO:51) 
35 CysXaaXaaCysXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO:52). 

The following 2 classes are preferred peptides which exclude semaphorin peptides 
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encoded in open reading frames of Variola major or Vaccinia viruses Grasshopper 
Semaphorin I. 



(f) TyrPhePhe[PheTyr]ArgGlu (SEQ ID NO: 18) 
5 Asp[LysTyr] Val[PheTyr] [PheTyrLeu][PheTyrIleLeu] [PheTyr] (SEQ ID NO:92) 
Asp[LysTyr]Val[PheTyr] [PheTyrEeLeu] [PheTyrEe] [PheTyr] (SEQ ID NO:93) 
[ValIle]Tyr[PheTyrLeu][PheTyrneLeu]Phe[ArgThr]Xaa[ThrAsn] (SEQ ID NO:94) 
[Vame]Tyr[PheTyrfleLeu][PheTyrfle]Phe[ArgThr]Xaa[ThrAsn] (SEQ ID NO: 95) 
[Vaine]Tyr[PheTyrneLeu][PheTyrneLeu]PheArgXaa[ThrAsn] (SEQ ID NO:96) 
10 Val[PheTyr] [PheTyrLeu] [PheTyrEeLeu] [PheTyr] [ArgThr] [GluAspVal] [ThrAsn] (SEQ ID 
NO:97) 

Val[PheTyr] [PheTyrEeLeu] [PheTyrlle] [PheTyr] [ArgThr] [GluAspVal] [ThrAsn] (SEQ ID 
NO:98) 

U Val[PheTyr] [PheTyrDeLeu] [PheTyrEeLeu] [PheTyr] Arg[GluAsp Val] [ThrAsn] (SEQ ID 
153 NO:99) 

!?{ (n) CysXaaXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:48) 
SI CysXaaXaaAspProXaaCysXaaTrpAsp (SEQ ID NO:49) 

J CysXaaXaaCysXaaXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO: 5 1 ) 

m CysXaaXaaCysXaaXaaAspXaaXaaCysXaaTrpAsp (SEQ ID NO:52). 

2CL 

; "y The following 5 classes include peptides which encompass peptides encoded in open 

Q reading frames of Variola major or Vaccinia viruses. Accordingly, in the event that these 
2 viral peptides are not novel per se, the present invention discloses a hitherto unforseen and 
fy unforseeable utility for these peptides as immunosuppressants and targets of anti-viral 
25 therapy. 

(b)CysGlyThr[AsnGly][AlaSerAsn][TyrPheHisGly][LysArgHisAsnGln] (SEQ ID NO:03) 
CysGlyThr[AsnGly][AlaSerAsn]XaaXaaPro (SEQ ID NO:04) 
CysGlyThr[AsnGly]XaaXaaXaaProXaa[CysAsp] (SEQ ID NO:05) 
30 CysGlyThrXaaXaaXaaXaaProXaa[CysAsp]XaaXaa[TyrEe] (SEQ ED NO:06) 

(f)Asp[LysPheTyr]Val[PheTyr] [PheTyrEeLeu] [PheTyrEeLeu] [PheTyr] (SEQ TO NO:22) 
[ValEe] [PheTyr] [PheTyrEeLeu] [PheTyrEeLeu]Phe[ArgThr]Xaa[ThrAsn] (SEQ ID NO:23) 
Val [PheTyr] [PheTyrEeLeu] [PheTyrEeLeu] [PheTyr] [ArgThr] [GluAspVal] [ThrAsn] (SEQ ID 
NO: 100) 

35 (i)[ArgLysAsn]Trp[ThrAlaSer][ThrAlaSer][PheTyrLeu]Leu[LysArg] (SEQ ID NO:30) 

Trp[ThrAlaSer] [ThrAlaSer] [PheTyrLeu]LeuLys[AlaSerValIleLeu]XaaLeu (SEQ ID 

NO:33) 
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Trp[ThrAlaSer][ThrAlaSer]XaaLeuLysXaaXaaLeuXaaCys (SEQ ID NO: 34) 
TrpXaa[ThrSer]XaaLeuLysXaaXaaLeuXaaCys (SEQ ID NO:35) 
(k) SerAla[VameLeu]CysXaa[PheTyr]XaaMet (SEQ ID NO: 39) 
(m) [ValLeuDe]ProXaaPro[ArgAla]ProGlyXaaCys (SEQ ID NO:42). 

5 

The disclosed semaphorin sequence data are used to define a wide variety of other 
semaphorin- and semaphorin receptor-specific binding agents using immunologic, 
chromatographic or synthetic methods available to those skilled in the art. 

Of particular significance are peptides comprising unique portions of semaphorin- 
10 specific receptors and polypeptides comprising a sequence substantially similar to that of a 

substantially full-length semaphorin receptor. Using semaphorin peptides, these receptors are 
identified by a variety of techniques known to those skilled in the art where a ligand to the 
target receptor is known, including expression cloning as set out in the exemplification 
p below. For other examples of receptor isolation with known ligand using expression cloning, 
ljj see, Staunton et al (1989) Nature 339, 61; Davis et al (1991) Science 253, 59; Lin et al (1992) 
5j Cell 68, 775; Gearing et al (1989) EMBO 8, 3667; Aruffo and Seed (1987) PNAS 84, 8573 

01 and references therein. Generally, COS cells are transfected to express a cDNA library or 

2 J PCR product and cells producing peptides/polypeptides which bind a semaphorin/receptor 

s peptide/polypeptide are isolated. For neurosemaphorin receptors, fetal brain cDNA libraries 
20; are preferred; for immunosemaphorin receptors, libraries derived from activated lymphoid or 
q myeloid cell lines or tissue derived from sites of inflammation or delayed-type 
*P hypersensitivity are preferred; and for semaphorin and semaphorin receptor variants used by 
St tumor cells to evade immune surveillance or suppress an immune response 

(oncosemaphorins), libraries derived from cancerous tissue or tumor cell lines resistant to the 
25 host immune system are preferred. Alternatively, PCR primers based upon known 

semaphorin/receptor sequences such as those disclosed herein are used to amplify PCR 
product from such tissues/cells. Other receptor/ligand isolation methods using immobilized 
ligand or antibody are known to those skilled in the art. 

Semaphorin receptor peptides with receptor binding specificity are identified by a 
30 variety of ways including having conserved consensus sequences with other semaphorin 

receptors, by crosslinking to ligand or receptor-specific antibody, or preferably, by screening 
such peptides for semaphorin binding or disruption of semaphorin -receptor binding. Methods 
for identifying semaphorin receptor peptides with the requisite binding activity are described 
herein or otherwise known to those skilled in the art. By analogous methods, semaphorin 
35 receptor peptides are used to define additional semaphorin peptides with semaphorin binding 
specificity, particularly receptor specificity. 

The various semaphorin and semaphorin receptor peptides are used to define 
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functional domains of semaphores, identify compounds that associate with semaphores, 
design compounds capable of modulating semaphorin-mediated nerve and immune cell 
function, and define additional semaphorin and semaphorin receptor-specific binding agents. 
For example, semaphorin mutants, including deletion mutants are generated from the 

5 disclosed semaphorin sequences and used to identify regions important for specific protein - 
ligand or protein-protein interactions, for example, by assaying for the ability to mediate 
repulsion or preclude aggregation in cell-based assays as described herein. Further, x-ray 
crystallographic data of the disclosed protein are used to rationally design binding molecules 
of determined structure or complementarity for modulating growth cone growth and 

10 guidance. 

Additional semaphorin- and receptor-specific agents include specific antibodies that 
can be modified to a monovalent form, such as Fab, Fab', or Fv, specifically binding 
oligopeptides or oligonucleotides and most preferably, small molecular weight organic 
t receptor antagonists. For example, the disclosed semaphorin and receptor peptides are used 
m as imrnunogens to generate semaphorin- and receptor-specific polyclonal or monoclonal 
ffj antibodies. See, Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring 
f?5 Harbor Laboratory, for general methods. Anti-idiotypic antibody, especially internal imaging 
US anti-ids are also prepared using the disclosures herein. 

iy In addition to semaphorin and semaphorin-receptor derived polypeptides and peptides, 

2D other prospective agents are screened from large libraries of synthetic or natural compounds. 
[H For example, numerous means are available for random and directed synthesis of saccharide, 
]? peptide, and nucleic acid based compounds. Alternatively, libraries of natural compounds in 
3 the form of bacterial, fungal, plant and animal extracts are available or readily producible. 
[ y Additionally, natural and synthetically produced libraries and compounds are readily 
25 modified through conventional chemical, physical, and biochemical means. See, e.g. 

Houghten et al. and Lam et al (1991) Nature 354, 84 and 81, respectively and Blake and Litzi- 
Davis (1992), Bioconjugate Chem 3, 510. 

Useful agents are identified with a range of assays employing a compound comprising 
the subject peptides or encoding nucleic acids. A wide variety of in vitro, cell-free binding 
30 assays, especially assays for specific binding to immobilized compounds comprising 

semaphorin or semaphorin receptor peptide find convenient use. While less preferred, cell- 
based assays may be used to determine specific effects of prospective agents on semaphorin- 
receptor binding may be assayed. Optionally, the intracellular C-terminal domain is 
substituted with a sequence encoding a oligopeptide or polypeptide domain that provides 
35 a detectable intracellular signal upon ligand binding different from the natural receptor. 
Useful intracellular domains include those of the human insulin receptor and the 
TCR, especially domains with kinase activity and domains capable of 
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triggering calcium influx which is conveniently detected by fluorimetry by preloading the 
host cells with Fura-2. More preferred assays involve simple cell-free in vitro binding of 
candidate agents to immobilized semaphorin or receptor peptides, or vice versa. See, e.g. 
Fodor et al (1991) Science 251, 767 for light directed parallel synthesis method. Such assays 
5 are amenable to scale-up, high throughput usage suitable for volume drug screening. 

Useful agents are typically those that bind to a semaphorin or disrupt the association 
of a semaphorin with its receptor. Preferred agents are semaphorin-specific and do not cross 
react with other neural or lymphoid cell membrane proteins. Useful agents may be found 
within numerous chemical classes, though typically they are organic compounds; preferably 
10 small organic compounds. Small organic compounds have a molecular weight of more than 
150 yet less than about 4,500, preferably less than about 1500, more preferably, less than 
about 500. Exemplary classes include peptides, saccharides, steroids, heterocyclics, 
polycyclics, substituted aromatic compounds, and the like. 

Selected agents may be modified to enhance efficacy, stability, pharmaceutical 
ljfg compatibility, and the like. Structural identification of an agent may be used to identify, 
Jjj generate, or screen additional agents. For example, where peptide agents are identified, they 
,j may be modified in a variety of ways as described above, e.g.to enhance their proteolytic 
stability. Other methods of stabilization may include encapsulation, for example, in 
liposomes, etc. 

20 The subject binding agents may be prepared in a variety of ways known to those 

;r; skilled in the art. For example, peptides under about 60 amino acids can be readily 

i y 

q synthesized today using conventional commercially available automatic synthesizers, 
j Alternatively, DNA sequences may be prepared encoding the desired peptide and inserted 
m into an appropriate expression vector for expression in a prokaryotic or eukaryotic host. A 
25 wide variety of expression vectors are available today and may be used in conventional ways 
for transformation of a competent host for expression and isolation. If desired, the open 
reading frame encoding the desired peptide may be joined to a signal sequence for secretion, 
so as to permit isolation from the culture medium. Methods for preparing the desired 
sequence, inserting the sequence into an expression vector, transforming a competent host, 
30 and growing the host in culture for production of the product may be found in U.S. Patent 
Nos. 4,710,473, 4,711,843 and 4,713,339. 

For therapeutic uses, the compositions and agents disclosed herein may be 
administered by any convenient way, preferably parenterally, conveniently in a 
pharmaceutically or physiologically acceptable carrier, e.g., phosphate buffered saline, saline, 
35 deionized water, or the like. Typically, the compositions are added to a retained physiological 
fluid such as blood or synovial fluid. For CNS administration, a variety of techniques are 
available for promoting transfer of the therapeutic across the blood brain barrier including 
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disruption by surgery or injection, drugs which transciently open adhesion contact between 
CNS vasculature endothelial cells, and compounds which fascilitate translocation through 
such cells. As examples, many of the disclosed therapeutics are amenable to directly injected 
or infused, contained within implants e.g. osmotic pumps, grafts comprising appropriately 
5 transformed cells. Generally, the amount administered will be empirically determined, 
typically in the range of about 10 to 1000 |J.g/kg of the recipient. For peptide agents, the 
concentration will generally be in the range of about 50 to 500 fig/ml in the dose 
administered. Other additives may be included, such as stabilizers, bactericides, etc. These 
additives will be present in conventional amounts. 
10 The invention provides isolated nucleic acid sequences encoding the disclosed 

semaphorin and semaphorin receptor peptides and polypeptides, including sequences 
substantially identical to sequences encoding such polypeptides. An "isolated" nucleic acid 
sequence is present as other than a naturally occurring chromosome or transcript in its natural 
state and typically is removed from at least some of the nucleotide sequences with which it is 
ljg normally associated with on a natural chromosome. A complementary sequence hybridizes to 
y a unique portion of the disclosed semaphorin sequence under low stringency conditions, for 
Cj example, at 50°C and SSC (0.9 M saline/0.09 M sodium citrate) and that remains bound 
Q n when subject to washing at 55 °C with SSC. Regions of non-identity of complementary 
JjfJ nucleic acids are preferably or in the case of homologous nucleic acids, a nucleotide change 
20 providing a redundant codon. A partially pure nucleotide sequence constitutes at least about 
;;r! 5%, preferably at least about 30%, and more preferably at least about 90% by weight of total 
q nucleic acid present in a given fraction. 

=F Unique portions of the disclosed nucleic acid sequence are of length sufficient to 

distinguish previously known nucleic acid sequences. Thus, a unique portion has a 
25 nucleotide sequence at least long enough to define a novel oligonucleotide. Preferred nucleic 
acid portions encode a unique semaphorin peptide. The nucleic acids of the invention and 
portions thereof, other than those used as PCR primers, are usually at least about 60 bp and 
usually less than about 60 kb in length. PCR primers are generally between about 15 and 100 
nucleotides in length. 

30 Nucleotide (cDNA) sequences encoding several full length semaphorins are disclosed 

herein. The invention also provides for the disclosed sequences modified by transitions, 
transversions, deletions, insertions, or other modifications such as alternative splicing and 
also provides for genomic semaphorin sequences, and gene flanking sequences, including 
regulatory sequences; included are DNA and RNA sequences, sense and antisense. Preferred 

35 DNA sequence portions include portions encoding the preferred amino acid sequence 

portions disclosed above. For antisense applications where the inhibition of semaphorin 
expression is indicated, especially useful oligonucleotides are between about 10 and 30 
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nucleotides in length and include sequences surrounding the disclosed ATG start site, 
especially the oligonucleotides defined by the disclosed sequence beginning about 5 
nucleotides before the start site and ending about 10 nucleotides after the disclosed start site. 
Other especially useful semaphorin mutants involve deletion or substitution modifications of 

5 the disclosed cytoplasmic C-termini of transmembrane semaphorins. Accordingly, 

semaphorin mutants with semaphorin binding affinities but with altered intracellular signal 
transduction capacities are produced. 

For modified semaphorin-encoding sequences or related sequences encoding proteins 
with semaphorin-like functions, there will generally be substantial sequence identity between 
10 at least a segment thereof and a segment encoding at least a portion of the disclosed 

semaphorin sequence, preferably at least about 60%, more preferably at least 80%, most 
preferably at least 90% identity. Homologous segments are particularly within semaphorin 
domain-encoding regions and regions encoding protein domains involved in protein-protein, 

I =4 particularly semaphorin-receptor interactions and differences within such segments are 

K5j particularly conservative substitutions. 

□ 

S= Typically, the invention's semaphorin peptide encoding polynucleotides are associated 

SJ with heterologous sequences. Examples of such heterologous sequences include regulatory 
~J sequences such as promoters, enhancers, response elements, signal sequences, 
HI polyadenylation sequences, etc., introns, 5' and 3' noncoding regions, etc. Other useful 
20 heterologous sequences are known to those skilled in the art or otherwise disclosed references 

cited herein. According to a particular embodiment of the invention, portions of the 
□ semaphorin encoding sequence are spliced with heterologous sequences to produce soluble, 
J secreted fusion proteins, using appropriate signal sequences and optionally, a fusion partner 
tj such as P-Gal. 

25 The disclosed sequences are also used to identify and isolate other natural 

semaphorins and analogs. In particular, the disclosed nucleic acid sequences are used as 
hybridization probes under low-stringency or PCR primers, e.g. oligonucleotides encoding 
functional semaphorin domains are 32 P-labeled and used to screen A,cDNA libraries at low 
stringency to identify similar cDNAs that encode proteins with related functional domains. 

30 Additionally, nucleic acids encoding at least a portion of the disclosed semaphorin are used to 
characterize tissue specific expression of semaphorin as well as changes of expression over 
time, particularly during organismal development or cellular differentiation. 

The semaphorin encoding nucleic acids can be subject to alternative purification, 
synthesis, modification, sequencing, expression, transfection, administration or other use by 

35 methods disclosed in standard manuals such as Molecular Cloning, A Laboratory Manual 
(2nd Ed., Sambrook, Fritsch and Maniatis, Cold Spring Harbor), Current Protocols in 
Molecular Biology (Eds. Aufubel, Brent, Kingston, More, Feidman, Smith and Stuhl, Greene 
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Publ. Assoc., Wiley-Interscience, NY, NY, 1992) or that are otherwise known in the art. For 
example, the nucleic acids can be modified to alter stability, solubility, binding affinity and 
specificity, etc. semaphorin-encoding sequences can be selectively methylated, etc. The 
nucleic acid sequences of the present invention may also be modified with a label capable of 
5 providing a detectable signal, either directly or indirectly. Exemplary labels include 
radioisotopes, fluorescers, biotinylation, etc. 

The invention also provides vectors comprising nucleic acids encoding semaphorin 
peptides, polypeptides or analogs. A large number of vectors, including plasmid and viral 
vectors, have been described for expression in a variety of eukaryotic and prokaryotic hosts. 
10 Advantageously, vectors may also include a promotor operably linked to the semaphorin- 
encoding portion. Vectors will often include one or more replication systems for cloning or 
expression, one or more markers for selection in the host, e.g. antibiotic resistance. The 
inserted semaphorin coding sequences may be synthesized, isolated from natural sources, 
i , prepared as hybrids, etc. Suitable host cells may be transformed/transfected/infected by any 
lffi suitable method including electroporation, CaCl 2 mediated DNA uptake, viral infection, 
2? microinjection, microprojectile, or other methods. 

\| Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, and 

0' 1 plant and animal cells, especially mammalian cells. Of particular interest are E. coli . B. 
T subtilis . Saccharomvces cerevisiae . SF9 cells, C129 cells, 293 cells, Neurospora, and CHO, 
20 COS, HeLa cells, immortalized mammalian myeloid and lymphoid cell lines, and pluripotent 
l^j cells, especially mammalian ES cells and zygotes. Preferred replication systems include 
p M13, ColEl, SV40, baculovirus, lambda, adenovirus, AAV, BPV, etc. A large number of 
jH transcription initiation and termination regulatory regions have been isolated and shown to be 
« | effective in the transcription and translation of heterologous proteins in the various hosts. 
25 Examples of these regions, methods of isolation, manner of manipulation, etc. are known in 
the art. Under appropriate expression conditions, host cells can be used as a source of 
recombinantly produced semaphorins or analogs. 

For the production of stably transformed cells and transgenic animals, nucleic acids 
encoding the disclosed semaphorins may be integrated into a host genome by recombination 
30 events. For example, such a sequence can be microinjected into a cell, and thereby effect 
homologous recombination at the site of an endogenous gene, an analog or pseudogene 
thereof, or a sequence with substantial identity to an semaphorin-encoding gene. Other 
recombination-based methods such as nonhomologous recombinations, deletion of 
endogenous gene by homologous recombination, especially in pluripotent cells, etc., provide 
35 additional applications. Preferred transgenics and stable transformants over-express the 
disclosed receptor gene and find use in drug development and as a disease model. 
Alternatively, knock-out cells and animals find use in development and functional studies. 



16 



B94-002-8 



Methods for making transgenic animals, usually rodents, from ES cells or zygotes are known 
to those skilled in the art. 

The compositions and methods disclosed herein may be used to effect gene therapy. 
See, e.g. Zhu et al. (1993) Science 261, 209-211; Gutierrez et al. (1992) Lancet 339, 715-721 . 
5 For example, cells are transfected with semaphorin sequences operably linked to gene 

regulatory sequences capable of effecting altered semaphorin expression or regulation. To 
modulate semaphorin translation, cells may be transfected with complementary antisense 
polynucleotides. For gene therapy involving the transfusion of semaphorin transfected cells, 
administration will depend on a number of variables that are ascertained empirically. For 
10 example, the number of cells will vary depending on the stability of the transfused cells. 
Transfusion media is typically a buffered saline solution or other pharmacologically 
acceptable solution. Similarly the amount of other administered compositions, e.g. 
transfected nucleic acid, protein, etc., will depend on the manner of administration, purpose 
of the therapy, and the like. 



The following examples are offered by way of illustration and not by way of 
limitation. 



1 

m 

jJt EXAMPLES 
20 

Jr! I. Isolation and characterization of Grasshopper Semaphorin I (SEP ID NOS: 57 and 
Q 58(previouslv referred to as Fasciclin IV) 

i 

i^jj In order to identify cell surface molecules that function in selective fasciculation, a 

25 series of monoclonal antibody (MAb) screens was conducted. The immunogen used for most 
of these screens was membranes from the longitudinal connectives (the collection of 
longitudinal axons) between adjacent segmental ganglia of the nervous system of the larval 
grasshopper. From these screens, MAb 3B1 1 and 8C6 were used to purify and characterize 
two surface glycoproteins, fasciclin I and fasciclin n, see, Bastiani et al., 1987; the genes 
30 encoding both were subsequently cloned, see, Snow et al. 1989, Zinn et al. 1988, and 
Harrelson and Goodman, 1988. 

Another MAb isolated during these screens, MAb 6F8, was chosen for the present 
study because, just as with fasciclin I and fasciclin n, the antigen recognized by this MAb is 
expressed on a different but overlapping subset of axon pathways in the developing CNS. 
35 The 6F8 antigen appears to be localized on the outside of cell surfaces, as indicated by MAb 
binding when incubated both in live preparations, and in fixed preparations in which no 
detergents have been added. Because the 6F8 antigen is a surface glycoprotein expressed on 
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a subset of axon fascicles (see below), we call it fasciclin IV. 

Fasciclin IV expression begins early in embryonic development before 
axonogenesis. At 29% of development, expression is seen on the surface of the midline 
mesectodermal cells and around 5-7 neuroblasts and associated ectodermal cells per 
5 hemisegment. This expression is reminiscent of the mesectodermal and neuroblast-associated 
expression observed with both fasciclin I and fasciclin II; however, in each case, the pattern 
resolves into a different subset of neuroblasts and associated ectodermal cells. 

At 32% of development, shortly after the onset of axonogenesis in the CNS, 
fasciclin IV expression is seen on the surface of the axons and cell bodies of the three pairs of 
10 MP4, MP5, and MP6 midline progeny, the three U motoneurons, and on several unidentified 
neurons in close proximity to the U's. This is in contrast to fasciclin n, which at this stage is 
expressed on the MP1 and dMP2 neurons, and fasciclin I, which is expressed on the U 
neurons but not on any midline precursor progeny. 
, The expression of fasciclin IV on a subset of axon pathways is best observed around 

liSj 40% of development, after the establishment of the first longitudinal and commissural axon 
^ pathways . At this stage, the protein is expressed on two longitudinal axon fascicles, a subset 
%4 of commissural axon fascicles, a tract extending anteriorly along the midline, and a subset of 
f?5 fascicles in the segmental nerve (SN) and intersegmental nerve (ISN) roots. 
"ffi Specifically, fasciclin IV is expressed on the U fascicle, a longitudinal pathway 

20 (between adjacent segmental neuromeres) pioneered in part by the U neurons, and on the A/P 
J; longitudinal fascicle (in part an extension of the U fascicle within each segmental 
Q neuromere). In addition, fasciclin IV is also expressed on a second narrower, medial, and 
•J more ventral longitudinal pathway. The U axons turn and exit the CNS as they pioneer the 
fU ISN; the U's and many other axons within the ISN express fasciclin IV. The continuation of 
25 the U fascicle posterior to the ISN junction is also fasciclin IV-positive. The specificity of 
fasciclin IV for distinct subsets of longitudinal pathways can be seen by comparing fasciclin 
IV and fasciclin II expression in the same embryo; fasciclin IV is expressed on the U and A/P 
pathways whereas fasciclin II is expressed on the MP1 pathway. 

The axons in the median fiber tract (MFT) also express fasciclin IV. The MFT is 
30 pioneered by the three pairs of progeny of the midline precursors MP4, MP5, and MP6. The 
MFT actually contains three separate fascicles. The axons of the two MP4 progeny pioneer 
the dorsal MFT fascicle and then bifurcate at the posterior end of the anterior commissure; 
whereas the axons of the two MP6 progeny pioneer the ventral MFT fascicle and then 
bifurcate at the anterior end of the posterior commissure. Fasciclin IV is expressed on the cell 
35 bodies of the six MP4, MP5, and MP6 neurons, and on their growth cones and axons as they 
extend anteriorly in the MFT and bifurcate in one of the two commissures. However, this 
expression is regional in that once these axons bifurcate and begin to extend laterally across 
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the longitudinal pathways and towards the peripheral nerve roots, their expression of fasciclin 
IV greatly decreases. Thus, fasciclin IV is a label for the axons in the MFT and their initial 
bifurcations in both the anterior and posterior commissures. It appears to be expressed on 
other commissural fascicles as well. However, the commissural expression of fasciclin IV is 
5 distinct from the transient expression of fasciclin II along the posterior edge of the posterior 
commissure, or the expression of fasciclin I on several different commissural axon fascicles 
in both the anterior and posterior commissure (Bastiani et al., 1987; Harrelson and Goodman, 
1988). 

Fasciclin IV is also expressed on a subset of motor axons exiting the CNS in the SN. 
10 The SN splits into two major branches, one anterior and the other posterior, as it exits the 

CNS. Two large bundles of motoneuron axons in the anterior branch express fasciclin IV at 
high levels; one narrow bundle of motoneuron axons in the posterior branch expresses the 
protein at much lower levels. Fasciclin IV is also expressed on many of the axons in the ISN. 

The CNS and nerve root expression patterns of fasciclin IV, fasciclin I, and fasciclin II 
IIS at around 40% of embryonic development are summarized below. Although there is some 
* . overlap in their patterns (e.g., both fasciclin IV and fasciclin I label the U axons), these three 
op surface glycoproteins label distinct subsets of axon pathways in the developing CNS. 
Uf 
PU 

Fasciclin IV is expressed on epithelial bands in the developing limb bud 

201 Fasciclin IV is expressed on the developing limb bud epithelium in circumferential 

^ bands; at 34.5% of development these bands can be localized with respect to constrictions in 
i: :g the epithelium that mark presumptive segment boundaries. In addition to a band just distal to 
y the trochanter/coxa segment boundary, bands are also found in the tibia, femur, coxa, and 

later in development a fifth band is found in the tarsus. Fasciclin IV is also expressed in the 
25 nascent chordotonal organ in the dorsal aspect of the femur. The bands in the tibia, 

trochanter, and coxa completely encircle the limb. However, the femoral band is incomplete, 
containing a gap on the anterior epithelia of this segment. 

The position of the Ti 1 axon pathway with respect to these bands of fasciclin IV- 
positive epithelia suggests a potential role for fasciclin IV in guiding the Til growth cones. 
30 First, the band of fasciclin IV expression in the trochanter, which is approximately three 
epithelial cell diameters in width when encountered by the Til growth cones, is the axial 
location where the growth cones reorient from proximal migration to circumferential branch 
extension. The Trl cell, which marks the location of the turn, lies within this band, usually 
over the central or the proximal cell tier. Secondly, although there is a more distal fasciclin 
35 IV expressing band in the femur, where a change in Til growth is not observed, there exists a 
gap in this band such that fasciclin IV expressing cells are not traversed by the Ti 1 growth 
cones. The Til axons also may encounter a fasciclin IV expressing region within the coxa, 
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where interactions between the growth cones, the epithelial cells, and the Cxi guidepost cells 
have not yet been investigated. In addition to its expression over the surface of bands of 
epithelial cells, fasciclin IV protein, as visualized with MAb 6F8, is also found on the basal 
surface of these cells in a punctate pattern. This punctate staining is not an artifact of the 
5 HRP immunocytochemistry since fluorescent visualization of MAb 6F8 is also punctate. The 
non-neuronal expression of fasciclin IV is not restricted to limb buds. Circumferential 
epithelial bands of fasciclin IV expression are also seen on subesophageal mandibular 
structures and on the developing antennae. 

10 MAb directed against fasciclin IV can alter the formation of the Til axon pathway in 
the limb bud 

The expression of fasciclin IV on an epithelial band at a key choice point in the 
formation of the Til axon pathway led us to ask whether this protein is involved in growth 
y, cone guidance at this location. To answer this question, we cultured embryos, or epithelial 
153 fillets (e. g., O'Connor et al., 1990), during the 5% of development necessary for normal 
U pathway formation, either in the presence or absence of MAb 6F8 or 6F8 Fab fragments. 
%| Under the culture conditions used for these experiments, defective Til pathways are observed 
'fl in 14% of limbs (Chang et al, 1992); this defines the baseline of abnormalities observed 
Sj using these conditions. For controls we used other MAbs and their Fab fragments that either 
20 bind to the surfaces of these neurons and epithelial cells (MAb 3B 1 1 against the surface 
51 protein fasciclin I) or do not (MAb 4D9 against the nuclear protein engrailed; Patel et al., 
Q 1989). To assess the impact of MAb 6F8 on Til pathway formation, we compared the 
;5 percentage of aberrant pathways observed following treatment with MAb 6F8 to that 
r-y observed with MAbs 3B 1 1 and 4D9. Our cultures began at 32% of development when the 
25 Til growth cones have not yet reached the epithelium just distal to the trochanter/coxa 
boundary and therefore have not encountered epithelial cells expressing fasciclin IV. 
Following approximately 30 hours in culture (-4% of development), embryos were fixed and 
immunostained with antibodies to HRP in order to visualize the Til axons and other neurons 
in the limb bud. Criteria for scoring the Til pathway, and the definition of "aberrant", are 
30 described in detail in the Experimental Procedures. 

Although MAb 6F8 does not arrest pathway formation, several types of distinctive, 
abnormal pathways are observed. These defects generally begin where growth cones first 
contact the fasciclin IV expressing cells in the trochanter. Normally, the Til neurons each 
have a single axon, and the axons of the two cells are fasciculated in that portion of the 
35 pathway within the trochanter. Following treatment with MAb 6F8, multiple long axon 
branches are observed within, and proximal to, the trochanter. Two major classes of 
pathways are taken by these branches; in 36% of aberrant limbs, multiple, long axon branches 
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extend ventrally in the region distal to the Cxi cells which contains the band of fasciclin IV 
expressing epithelial cells. In the ventral region of the trochanter, these branches often 
independently turn proximally to contact the Cxi cells, and thus complete the pathway in this 
region. 

5 In the second major class of pathway defect, seen in 47% of aberrant limbs, axon 

branches leave the trochanter at abnormal, dorsal locations, and extend proximally across the 
trochanter/coxa boundary. These axons then veer ventrally, often contacting the Cx 1 
neurons. The remaining 17% of defects include defasciculation distal to the trochanter, axon 
branches that fail to turn proximally in the ventral trochanter and continue into the posterior 
10 compartment of the limb, and axon branches which cross the trochanter/coxa boundary and 
continue to extend proximally without a ventral turn. 

When cultured in the presence of MAb 6F8, 43% of limbs exhibited malformed Til 
pathways (n = 381) as compared to 1 1% with MAb 3B1 1 (n = 230) and 5% with MAb 4D9 (n 
= 20). These percentages are pooled from treatments with MAbs concentrated from 
IS* hybridoma supernatant, IgGs isolated from these supernatants, and Fab fragments isolated 
O from these IgG preparations (see Experimental Procedures). The frequency of malformed Ti 1 
Zj pathways and the types of defects observed showed no significant variation regardless of the 
GP. method of antibody preparation or type of antibody used. Since Fabs show similar results as 
r; IgGs, the effects of MAb 6F8 are not due to cross linking by the bivalent IgG. 
20 In summary, following treatment with MAb 6F8, the Til pathway typically exhibits 

y abnormal morphology beginning just distal to the trochanter and at the site of fasciclin IV 
□ expression. The two most common types of Til pathway defects described above occur in 
J= 36% of experimental limbs (treated with MAb 6F8), but are seen in only 4% of control limbs 
;J (treated with MAbs 3 B 1 1 and 4D9). 
25* 

Fasciclin IV cDNAs encode a novel integral membrane protein 

Grasshopper fasciclin IV was purified by passing crude embryonic grasshopper lysates 
over a MAb 6F8 column. After affinity purification, the protein was eluted, precipitated, 
denatured, modified at cysteines, and digested with either trypsin or Lys-C. Individual 

30 peptides were resolved by reverse phase HPLC and microsequenced using standard methods. 

The amino acid sequences derived from these proteolytic fragments were used to 
generate oligonucleotide probes for PCR experiments, resulting in products that were used to 
isolate cDNA clones from the Zinn embryonic grasshopper cDNA library (Snow et al., 1988). 
Sequence analysis of these cDNAs reveals a single open reading frame (ORF) encoding a 

35 protein with two potential hydrophobic stretches of amino acids: an amino-terminal signal 

sequence of 20 residues and (beginning at amino acid 627) a potential transmembrane domain 
of 25 amino acids. Thus, the deduced protein has an extracellular domain of 605 amino 
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acids, a transmembrane domain, and a cytoplasmic domain of 78 amino acids. The calculated 
molecular mass of the mature fasciclin IV protein is 80 kd and is confirmed by Western blot 
analysis of the affinity purified and endogenous protein as described below. The extracellular 
domain of the protein includes 16 cysteine residues that fall into three loose clusters but do 
5 not constitute a repeated domain and are not similar to other known motifs with cysteine 
repeats. There are also six potential sites for N-linked glycosylation in the extracellular 
domain. Treatment of affinity purified fasciclin IV with N-Glycanase demonstrates that 
fasciclin IV does indeed contain N-linked oligosaccharides. Fasciclin IV shows no sequence 
similarity when compared with other proteins in the PIR data base using BLASTP (Altschul 
10 et al., 1990), and is therefore a novel type I integral membrane protein. 

A polyclonal antiserum directed against the cytoplasmic domain of the protein 
encoded by the fasciclin IV cDNA was used to stain grasshopper embryos at 40% of 
development. The observed staining pattern was identical to that seen with MAb 6F8. On 
LA Western blots, this antiserum recognizes the protein we affinity purified using MAb 6F8 and 
1® then subjected to microsequence analysis. Additionally, the polyclonal serum recognizes a 
jz! protein of similar molecular mass from grasshopper embryonic membranes. Taken together 
S| these data indicate that the sequence we have obtained is indeed fasciclin IV. 

Four other cell surface proteins that label subsets of axon pathways in the insect 
A] nervous system (fasciclin I, fasciclin n, fasciclin in, and neuroglian) are capable of mediating 
2© homophilic cell adhesion when transfected into S2 cells in vitro (Snow et al., 1989; Elkins et 
al., 1990b; Grenningloh et al., 1990). To ask whether fasciclin IV can function as a 
Q homophilic cell adhesion molecule, the fasciclin IV cDNA with the complete ORF was 
+; placed under the control of the inducible metallothionein promoter (Bunch et al., 1988), 
py transfected into S2 cells, and assayed for its ability to promote adhesion in normally non- 
25 adhesive S2 cells. Following induction with copper, fasciclin IV was synthesized in these S2 
cells as shown by Western blot analysis and cell surface staining of induced S2 cells with the 
polyclonal antiserum described above. 

We observed no evidence for aggregation upon induction of fasciclin IV expression, 
thus suggesting that, in contrast to the other four proteins, fasciclin IV does not function as a 
30 homophilic cell adhesion molecule. Alternatively, fasciclin IV-mediated aggregation might 
require some further posttranslational modification, or co-factor, not supplied by the S2 cells, 
but clearly this protein acts differently in the S2 cell assay than the other four axonal 
glycoproteins previously tested. This is consistent with the pattern of fasciclin IV expression 
in the embryonic limb since only the epithelial cells and not the Til growth cones express 
35 fasciclin IV, and yet antibody blocking experiments indicate that fasciclin IV functions in the 
epithelial guidance of these growth cones. Such results suggest that fasciclin IV functions in 
a heterophilic adhesion or signaling system. 
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Discussion 

Fasciclin IV is expressed on groups of axons that fasciculate in the CNS, suggesting 
that, much like other insect axonal glycoproteins, it functions as a homophilic cell adhesion 
molecule binding these axons together. Yet, in the limb bud, fasciclin IV is expressed on a 
5 band of epithelium but not on the growth cones that reorient along this band, suggesting a 
heterophilic function. That fasciclin IV functions in a heterophilic rather than homophilic 
fashion is supported by the lack of homophilic adhesion in S2 cell aggregation assays. In 
contrast, fasciclin I, fasciclin II, fasciclin HI, and neuroglian all can function as homophilic 
cell adhesion molecules (Snow et al., 1989; Elkins et al., 1990b; Grenningloh et al., 1990). 
10 cDNA sequence analysis indicates that fasciclin IV is an integral membrane protein 

with a novel sequence not related to any protein in the present data base. Thus, fasciclin IV 
represents a new type of protein that functions in the epithelial guidance of pioneer growth 
cones in the developing limb bud. Given its expression on a subset of axon pathways in the 
u developing CNS, fasciclin IV functions in the guidance of CNS growth cones as well, 
lfi The results from the MAb blocking experiments illuminate several issues in Til 

y growth cone guidance and axon morphogenesis in the limb. First, the most striking change in 
\j growth cone behavior in the limb is the cessation of proximal growth and initiation of 
Cm circumferential extension of processes upon encountering the trochanter/coxa boundary 
?y region (Bentley and Caudy, 1983; Caudy and Bentley, 1987). This could be because the band 
20 of epithelial cells within the trochanter promotes circumferential growth, or because the cells 
~j comprising the trochanter/coxa boundary and the region just proximal to it are non- 
□ permissive or aversive for growth cone migration, or both. The extension of many axon 

branches across the trochanter/coxa boundary following treatment with MAb 6F8 suggests 
m that the trochanter/coxa boundary cells, which do not express fasciclin IV, are not aversive or 
25 non-permissive. Thus the change in behavior at the boundary appears to be due to the ability 
of fasciclin IV expressing epithelial cells to promote circumferential extension of processes 
from the Til growth cones. 

Secondly, treatment with MAb 6F8 results in frequent defasciculation of the axons of 
the two Til neurons, and also formation of abnormal multiple axon branches, within the 
30 trochanter over fasciclin IV-expressing epithelial cells. Previous studies have shown that 

treatment with antibodies against ligands expressed on non-neural substrates (Landmesser et 
al., 1988), or putative competitive inhibitors of substrate ligands (Wang and Denburg, 1992) 
can promote defasciculation and increased axonal branching. Our results suggest that Ti 1 
axomaxon fasciculation and axon branching also are strongly influenced by interactions with 
35 substrate ligands, and that fasciclin IV appears to be a component of this interaction within 
the trochanter. 

Thirdly, despite the effects of MAb 6F8 on axon branching, and on crossing the 
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trochanter/coxa boundary, there remains a pronounced tendency for branches to grow 
ventrally both within the trochanter and within the distal region of the coxa. Consequently, 
all signals which can promote ventral migration of the growth cones have not been blocked 
by MAb 6F8 treatment. Antibody treatment may have a threshold effect in which ventral 
5 growth directing properties of fasciclin IV are more robust, and less incapacitated by 
treatment, than other features; alternatively, guidance information promoting ventral 
migration may be independent of fasciclin IV. Time lapse video experiments to determine 
how the abnormal pathways we observe actually form can resolve these issues. 

These results demonstrate that fasciclin IV functions as a guidance cue for the Ti 1 

10 growth cones just distal to the trochanter/coxa boundary, is required for these growth cones to 
stop proximal growth and spread circumferentially, and that the function of fasciclin IV in 
Til pathway formation result from interactions between a receptor/ligand on the Til growth 
cones and fasciclin IV on the surface of the band of epithelial cells results in changes in 
u growth cone morphology and subsequent reorientation. Fasciclin IV appears to elicit this 

l|;i change in growth cone morphology and orientation via regulation of adhesion, a signal 

y transduction function, or a combination of the two. 

yi 

%f 

y t Experimental Procedures 

i 

20 Immunocytochemistry 

;y! Grasshopper embryos were obtained from a colony maintained at the U.C. Berkeley 

p and staged by percentage of total embryonic development (Bentley et al., 1979). Embryos 
£ were dissected in PBS, fixed for 40 min in PEM-FA [0.1 M PIPES (pH6.95), 2.0 mM EGTA, 
Sfj 1.0 mM MgS0 4 , 3.7% formaldehyde], washed for 1 hr with three changes in PBT (lx PBS, 
25 0.5% Triton X-100, 0.2% BSA), blocked for 30 min in PBT with 5% normal goat serum, and 
incubated overnight at 4°C in primary antibody. PBSap (lx PBS, 0.1% Saponin, o.2% BSA) 
was used in place of PBT with MAb 8G7. Antibody dilutions were as follows: MAb 6F8 1:1, 
polyclonal antisera directed against a fasciclin IV bacterial fusion protein (#98-3) 1:400; MAb 
8G7 1:4; MAb 8C6 1:1. The embryos were washed for one hour in PBT with three changes, 
30 blocked for 30 min, and incubated in secondary antibody for at least 2 hr at room temperature. 
The secondary antibodies were HRP-conjugated goat anti-mouse and anti-rat IgG (Jackson 
Immunoresearch Lab), and were diluted 1:300 . Embryos were washed in PBT for one hour 
with three changes and then reacted in 0.5% diaminobenzidine (DAB) in PBT. The reaction 
was stopped with several washes in PBS and the embryos were cleared in a glycerol series 
35 (50%, 70%, 90%), mounted and viewed under Nomarski or bright field optics. For double- 
labelled preparations the first HRP reaction was done in PBT containing 0.06% NiCl, 
followed by washing, blocking, and incubation overnight in the second primary antibody. 
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The second antibody was visualized with a DAB reaction as described above. Embryos 
cultured in the presence of monoclonal antibodies were fixed and incubated overnight in goat 
anti-HRP (Jackson Immunoresearch Labs) conjugated to RITC (Molecular Probes), washed 
for one hour in PBT with three changes, mounted in 90% glycerol, 2.5% DAB CO 
5 (Polysciences), and viewed under epifluorescence. S2 cells were stained with polyclonal sera 
#98-3 diluted 1:400 and processed as described previously (Snow et al, 1989). 

Monoclonal Antibody Blocking Experiments 

In order to test for functional blocking, monoclonal antibody reagents were prepared 
10 as follows. Hybridoma supernatant was brought to 20% with H 2 0-saturated (NH 4 ) 2 S0 4 , 

incubated in ice 1 hr, and spun at 15,000 g at 4°C for 20 min. The supernatant was brought to 
56% with H 2 0-saturated (NH 4 ) 2 S0 4 , incubated overnight at 4°C, spun as above. The pellet 
was resuspended in PBS using approximately 1/40 volume of the original hybridoma 
supernatant (often remaining a slurry) and dialyzed against lx PBS overnight at 4°C with two 
l|j changes. This reagent is referred to as "concentrated hybridoma supernatant." Purified IgG 
Q was obtained by using Immunopure Plus Immobilized Protein A IgG Purification Kit (Pierce) 
Zj to isolate IgG from the concentrated hybridoma supernatant. Fab fragments were obtained 
CP using the ImmunoPure Fab Preparation Kit (Pierce) from the previously isolated IgGs. For 
J! blocking experiments each reagent was diluted into freshly made supplemented RPMI culture 
20 media (O'Connor et al., 1990) and dialyzed overnight at 4°C against 10 volumes of the same 
;~; culture media. Dilutions were as follows: concentrated hybridoma supernatant 1:4; purified 
q IgG 150mg/ml; Fab 75mg/ml. 

4? Embryos for culture experiments were carefully staged to between 31 and 32% of 

pi development. As embryos in each clutch typically differ by less that 1% of embryonic 

25 development from each other, the growth cones of the Ti 1 neurons at the beginning of the 

culture period were located approximately in the mid-femur, well distal to the trochanter/coxa 
segment boundary. From each clutch at least two limbs were filleted and the Til neurons 
labelled with the lipophilic dye Di I (Molecular Probes) as described (O'Connor et al., 1 990) 
in order to confirm the precise location of the Til growth cones. Prior to culturing, embryos 

30 were sterilized and dissected (Chang et al., 1992). The entire amnion and dorsal membrane 
was removed from the embryo to insure access of the reagents during culturing. Embryos 
were randomly divided into groups and cultured in one of the blocking reagents described 
above. Cultures were incubated with occasional agitation at 30°C for 30 hrs. At the end of 
the culture period embryos were fixed and processed for analysis as described above in 

35 immunocytochemistry. 

For each culture experiment, the scoring of the Til pathway in each limb was 
confirmed independently by a second observer. There was no statistically significant 
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variation between the two observers. Limbs from MAb cultured embryos were compared to 
representative normal limbs from non-MAb cultured embryos and were scored as abnormal if 
any major deviation from the normal Til pathway was observed. The Til pathway was 
scored as abnormal for one or more of the following observed characteristics: (1) 
5 defasciculation for a minimum distance of approximately 25 mm anywhere along the 
pathway, (2) multiple axon branches that extended ventrally within the trochanter, (3) 
presence of one or more axon branches that crossed the trochanter/coxa boundary dorsal to 
the Cxi cells, but then turned ventrally in the coxa and contacted the Cxi cells, (4) the 
presence of axon branches that crossed the trochanter/coxa segment boundary, did not turn 
10 ventrally, but continued proximally toward the CNS, and (5) failure of ventrally extended 
axons within the trochanter to contact and reorient proximally to the Cx 1 cells. For each 
MAb tested, the data are presented as a percentage of the abnormal Til pathways observed. 

y. Protein Affinity Purification and Microsequencing 

lfi Grasshopper fasciclin IV was purified by passing crude embryonic grasshopper lysate 

^ (Bastiani et al., 1987) over an Affi-Gel 15 column (Bio Rad) conjugated with the monoclonal 

SI antibody 6F8. Protein was eluted with 50 mM DEA (pH 1 1 .5), 0. 1 % Lauryldimethylamine 

; J ) oxide (Cal Bio Chem), and ImM EDTA. Protein was then precipitated, denatured, modified 

y at cysteines, and digested with either trypsin or Lys-C (Boehringer-Mannheim). Individual 

20 peptides were resolved by RP-HPLC and micro sequenced (Applied Biosystems 477 1 
Microsequencer) using standard chemistry. 



PCR Methods 

Q 

DNA complementary to poly(A)+ RNA from 45%-50% grasshopper embryos was 
25 prepared (Sambrook et al., 1989). PCR was performed using Perkin Elmer Taq polymerase 
(Saiki et al., 1988), and partially degenerate (based on grasshopper codon bias) 
oligonucleotides in both orientations corresponding to a portion of the protein sequence of 
several fasciclin TV peptides as determined by microsequencing. These oligonucleotides were 
designed so as not to include all of the peptide-derived DNA sequence, leaving a remaining 
30 9-12 base pairs that could be used to confirm the correct identity of amplified products. All 
possible combinations of these sequences were tried. 40 cycles were performed, the 
parameters of each cycle as follows: 96° for one min; a sequentially decreasing annealing 
temperature (2°C/cycle, starting at 65°C and ending at 55°C for remaining 35 cycles) for 1 
min; and at 72°C for one min. Reaction products were cloned into the Sma site of Ml 3 
35 mplO and sequenced. Two products, 1074 bp and 288 bp in length, contained DNA 3' to the 
oligonucleotide sequences encoded the additional amino acid sequence of the fasciclin IV 
peptide from which the oligonuceotides were derived. 
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cDNA Isolation and Sequence Analysis 

Both PCR products were used to screen 1 X 10 6 clones from a grasshopper embryonic 
cDNA library (Snow et al., 1988). 21 clones that hybridized to both fragments were 
recovered, and one 2600 bp clone was sequenced using the dideoxy chain termination 

5 method (Sanger et al., 1977) and Sequenase (US Biochemical Corp.). Templates were made 
from M13 mplO vectors containing inserts generated by sonication of plasmid clones. One 
cDNA was completely sequenced on both strands using Oligonucleotides and double strand 
sequencing of plasmid DNA (Sambrook et al., 1989) to fill gaps. Two additional cDNAs 
were analyzed by double strand sequencing to obtain the 3' 402 bp of the transcript. All three 
10 cDNAs were used to construct a plasmid containing the entire transcript. The complete 
transcript sequence is 2860 bp in length with 452 bp of 5' and 217 bp of 3' untranslated 
sequences containing stop codons in all reading frames. The predicted protein sequence was 
analyzed using the FASTDB and B LAS TP programs (Intelligenetics). The fasciclin IV ORF 

u unambiguously contains 10 of the 1 1 peptide sequences determined by microsequencing the 
lgj fasciclin IV trypsin and Lys-C peptides. 

O 

Generation of Polyclonal Antibodies From Bacterial Fusion Proteins 

Jj Bacterial trpE fusion proteins were constructed using pATH (Koerner et al., 1 99 1 ) 

J vectors, three restriction fragments encoding extracellular sequences, and one fragment (770 
20 bp Hindin/ Eco Rl, which includes amino acids 476-730 ) encoding both extracellular and 

intracellular sequences (designated #98-3). Fusion proteins were isolated by making an 
□ extract of purified inclusion bodies (Spindler et al., 1984), and rats were immunized with 
•j< ~70mg of protein emulsified in RIBI adjuvant (Immunochem Research). Rats were injected 

at two week intervals and serum was collected 7 days following each injection. Sera were 
25 tested histologically on grasshopper embryos at 45% of development. Construct #98-3 

showed a strong response and exhibited a staining pattern identical to that of MAb 6F8. Two 

of the extracellular constructs responded weakly but also showed the fasciclin D/ staining 

pattern. All pre-immune sera failed to stain grasshopper embryos. 

30 S2 Cell Transfections, Aggregation Assays, and Western Analysis 

A restriction fragment containing the full length fasciclin IV cDNA was cloned into 
pRmHa-3 (Bunch et al, 1988) and co-transformed into Drosophila S2 cells (Schneider, 1972) 
with the plasmid pPC4 (Jokerst et al., 1989), which confers a-amanitin resistance. S2 cells 
were transformed using the Lipofectin Reagent and recommended protocol (BRL) with minor 
35 modifications. All other S2 cell manipulations are essentially as described (Snow et 

al.,1989), including adhesion assays. Fasciclin IV expression in transformed cell lines was 
induced for adhesion assays and histology by adding CuS0 4 to 0.7 raM and incubating for at 
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least 48 hrs. Northern analysis confirmed transcription of fasciclin IV and surface-associated 
staining of the S2 cells with polyclonal serum #98-3 strongly suggests fasciclin IV is being 
transported to the cell surface. Preparation of membranes from S2 cells and from 
grasshopper embryos, PAGE, and Western blot were performed as previously described 
5 (Elkins et al., 1990b) except that signal was detected using the enhanced chemiluminescence 
immunodetection system kit (Amersham). Amount of protein per lane in each sample 
loaded: fasciclin IV protein, ~5 ng; S2 cell membranes, 40 mg; grasshopper membranes 80 
mg. Amounts of protein loaded were verified by Ponceau S staining of the blot prior to 
incubation with the antibody. 

10 
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10 

n. Isolation and characterization of Tribolium (SEP ID NOS:63 and 64) and Drosophila 
(SEP ID NOS:59 and 60) Semaphorin I. Drosophila Semaphorin II (SEP ID NOS:61 and 
62), Human Semaphorin m (SEP ID NGS:53 and 54) and Vaccinia Virus Semaphorin IV 
M (SEP ID NPS:55 and 56) and Variola Maior (smallpox) Virus Semaphorin V (SEP ID 

153 NPS:67and 68). 

O 
I 

%l We used our G-Semaphorin I cDNA in standard low stringency screening methods (of 

jjj both cDNA and genomic libraries) in an attempt to isolate a potential Semaphorin I 
m homologue from Drosophila. We were unsuccessful in these screens. Since the sequence 
2© was novel and shared no similarity to anything else in the data base, we then attempted to see 
if we could identify a Semaphorin I homologue in other, more closely related insects. If 
Q possible, we would then compare these sequences to find the most conserved regions, and 
;t? then to use probes (i.e., oligonucleotide primers for PCR) based on these conserved regions to 
Sj find a Drosophila homologue. 
25 In the process, we used the G-Semaphorin I cDNA in low stringency screens to clone 

Semaphorin I cDNAs from libraries made from locust Locusta migratoria embryonic RNA 
and from a cDNA embryonic library from the cricket Acheta domes tica. We used PCR to 
clone genomic fragments from genomic DNA in the beetle Tribolium, and from the moth 
Manduca. We then used the Tribolium genomic DNA fragment to isolate cDNA clones and 
30 ultimately sequenced the complete GRF for the Tribolium cDNA. 

In the meantime, we used the partial Tribolium and Manduca sequences in 
combination with the complete grasshopper sequence to identify conserved regions that 
allowed us to design primers for PCR in an attempt to clone a Drosophila Semaphorin I 
homologue. Several pairs of primers generated several different bands, which were 
35 subcloned and sequenced and several of the bands gave partial sequences of the Drosophila 
Semaphorin homologue. Pne of the bands gave a partial sequence of what was clearly a 
different, more divergent gene, which we call D-Semaphorin II. 



32 



B94-002-8 



Based on the sequence of PCR products, we knew we had identified two different 
Drosophila genes, one of which appeared to be the Semaphorin I homologue, and the other a 
second related gene. The complete ORF sequence of the D-Semaphorin I homologue 
revealed an overall structure identical to G-Semaphorin I: a signal sequence, an extracellular 
5 domain of around 550 amino acids containing 16 cysteines, a transmembrane domain of 25 
amino acids, and a cytoplasmic domain of 1 17 amino acids. When we had finished the 
sequence for D-Semaphorin II, we were able to begin to run homology searches in the data 
base, which revealed some of its structural features further described herein. The Semaphorin 
II sequence revealed a different structure: a signal sequence of 16 amino acids, a -525 amino 
10 acid domain containing 16 cysteines, with a single immunoglobulin (Ig) domain of 66 amino 
acids, followed by a short unique region of 73 amino acids. There is no evidence for either a 
transmembrane domain or a potential phospholipid linkage in the C-terminus of this protein. 
Thus, it appears that the D-Semaphorin II protein is secreted from the cells that produce it. 
, The grasshopper, Tribolium, and Drosophila Semaphorin I cDNA sequences, as well as the 
lgj sequence of the D-Semaphorin II cDNA, are shown herein. In addition, we used this same 
Jf technique to identify Semaphorin I genes in a moth, Manduca sexta, a locust, Locusta 
Cj migratoria, and a cricket, Acheta domestica. 

Q j : With this large family of insect Semaphorin genes, we identified a number of good 

~J stretches of the right amino acids (with the least degeneracy based on their codons) with 

2© strong homology for designing primers for PCR to look for human genes. We designed a set 
of oligonucleotide primers, and plated out several human cDNA libraries: a fetal brain library 
q (Stratagene), and an adult hippocampus library. We ultimately obtained a human cDNA PCR 
;J bands of the right size that did not autoprime and thus were good candidates to be bonafide 
-;j Semaphorin-like cDNAs from humans. These bands were purified, subcloned, and 

25 sequenced. 

Whole-mount in situ hybridization experiments showed that D-Semaphorin I and II 
are expressed by different subsets of neurons in the embryonic CNS. D-Semaphorin I is 
expressed by certain cells along the midline as well as by other neurons, whereas D- 
Semaphorin n is not expressed at the midline, but is expressed by a different subset of 

30 neurons. In addition, D-Semaphorin II is expressed by a subset of muscles prior to and during 
the period of innervation by specific motoneuron. On the polytene chromosomes, the D- 
Semaphorin I gene maps to (gene-band-chromosome) 29E1-22L and that of D-Semaphorin II 
to 53C9-102R. We have identified loss of function mutations in the D-Semaphorin I gene 
and a pair of P-element transposon insertions in the D-Semaphorin II gene which appear to 

35 cause severe phenotypes. 

When we lined up the G-Semaphorin I, T-Semaphorin I, D-Semaphorin I, and D- 
Semaphorin II sequences and ran the sequences through a sequence data base in search of 
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other sequences with significant similarity, we discovered a curious finding: these 
Semaphorins share sequence similarity with the A39R open reading frame (ORF) from 
Vaccinia virus and the A43R ORF from Variola Major (smallpox) virus and we discovered 
that the amino acids shared with the virus ORF were in the same regions where the insect 
5 proteins shared their greatest similarity. The viral ORF began with a putative signal 
sequence, continued for several hundred amino acids with sequence similarity to the 
Semaphorin genes, and then ended without any membrane linkage signal (suggesting that the 
protein as made by the infected cell would likely be secreted). 

We reasoned that the virus semaphorins were appropriated host proteins 

1 0 advantageously exploited by the viruses, which would have host counterparts that most likely 
function in the immune system to inhibit or decrease an immune response, just as in the 
nervous system they appear to function by inhibiting growth cone extension. Analogous to 
situations where viruses are thought to encode a secreted form of a host cellular receptor, here 
the virus may cause the infected cell to make a lot of the secreted ligand to mimic an 

15=2 inhibitory signal and thus help decrease the immune response. 

s 

v: HI. Isolation and characterization of Murine CNS Semaphorin HI Receptor using 
CP Epitope Tagged Human Semaphorin HI (hSHT) 

Jjj mRNA was isolated from murine fetal brain tissue and used to construct a cDNA 

2Q library in a mammalian exprssion vector, pCMX, essentially as in Davis et al. (1991) Science 

9 253, 59. 

ry 

q The transfection and screening procedure is modified from Lin et al (1992) Cell 68, 

Hp 775. COS cells grown on glass slide flaskettes are transfected with pools of the cDNA 
S clones, allowed to bind radioiodinated hSHI truncated at the C-terminus end of the 
25 semaphorin domain. In parallel, similarly treated COS cells are allowed to bind unlabelled 
human semaphorin HI truncated at the C-terminus end of the semaphorin domain and there 
joined to a 10-amino acid extension derived from the human c-myc proto-oncogene product. 
This modified hSHI allows the identification of hSHI receptors with the use of the tagged 
ligand as a bridge between the receptor and a murine monoclonal antibody which is specific 
30 for an epitope in the c-myc tag. Accordingly, after binding unlabelled hSHI the cells are 

exposed to the monoclonal which may be labeled directly or subsequently decorated with a 
secondary anti-mouse labeled antibody for enhanced signal amplification. 

Cells are then fixed and screened using dark-field microscopy essentially as in Lin et 
al. (supra). Positive clones are identified and sequence analysis of murine CNS Semphorin III 
35 receptor cDNA clones by the dideoxy chain termination method is used to construct full- 
length receptor coding sequences. 

It is evident from the above results that one can use the methods and compositions 
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disclosed herein for making and identifying diagnostic probes and therapeutic drugs. It will 
also be clear to one skilled in the art from a reading of this disclosure that advantage can be 
taken to effect alterations of semaphorin responsiveness in a host. 

All publications and patent applications cited in this specification are herein 
5 incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
foregoing invention has been described in some detail by way of illustration and example for 
purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in 
the art in light of the teachings of this invention that certain changes and modifications may 
10 be made thereto without departing from the spirit or scope of the appended claims. 

Table 1 . Deduced amino-acid sequence of semaphorin gene family. 

Approximate position of enumerated peptide classes are indicated by parenthetical (a) 
through (o); semaphorin domains are bounded by arrows; G: grasshopper semaphorin I (SEQ 
ljh ID NO:58), T: Tribolium semaphorin I (SEQ ID NO:64), Dl : Drosophila semaphorin I (SEQ 
O ID NO:60), D2: Drosophila semaphorin H (SEQ ID NO:62), H3: Human semaphorin I (SEQ 
Zj, ID NO:54), V4: Vaccinia virus semaphorin IV (SEQ ID NO:56), V5: Variola virus (human 
ffl small pox) semaphorin IV (SEQ ID NO:66); small case residues: conserved residues; 
if underline: signal sequence; solid bar: transmembrane domain; double dashes: 
2Q immunoglobulin domain. 
□ 



25; 



G MR AALVAVAALLWWALHAAAWV NDVS PKMYVOFGEERVOR 

T M WKILVWSICLIALCHAWMP DSSSKLINHFKSVESKS 
Dl 

D2 MSLLQ LSPLLALLLLLCSSVS ETAADYENTWNFYYER PCC TGNDQGNNNYGKHGADHVRE 

H 3 MGWLTR I VCLFWGVLLTA RANYONGKNNVPRL KL S YKEMLE SNNVI T 

V4 MMVLLH AVYSIVFVDVIII KVORYINDI 

I 

G f -Lg--nESHKDHf KLLeKDHNSlLvga — rNI VYnlSLRDLTEFTEQRiEwHSsGAHRELcY 

T f T - g — nAT F PDH f I VLNQDET S I L vgG — rNR VYnL S I FDL S ERKGGRi DwP S s D AHGQLc I 

Dl 

D2 fNCgKLY YRTf HMNeDRDT- 1 YvgaMDrVF RVnLQNI S S SNCNRDAi NLE PTRDDWS c V 

H3 fN-gLAnSSSYHTf LLDeERSR-lYvgaKDHIFSFDLVnl KDFQKiVwPVsYTRRDEcK 

V4 LTLDIFYLFKKMIPLLFILFYFANGIEWHKFETSEEIISTYLLDDV LYTGVNGAVYTFSN 



DDCQN- YIR ( a ) ICGTN(b) 

G LkgkS-Eddcqn-yir — VlAKIDDDrVLIcgtnaYKpLcRHyALKd GDyWeKEYEgRg 

T LkgkT-Dddcqn-yir — HYSSEPGKlVIcgtnSYKpLcRTyAFKE GKyLVeKEVEglg 

Dl Eddcqn-yir — iMWPSPGrlFvcgtnSFRpMcNTyllSd SNyTLeATKNgQA 

45 D2 SkgkSQIFdcKnHViQ — SMDQGD — rlYvcgtnaHNpKDYViYANL THLPRSEYVIgVgLGIA 

H3 WAgkDILKEcAn-FiK — VlKAYNQTHlYAcgtGaFHpIcTYiEIGHPEDNIFKLENSHFENgRg 

V4 NkLNKTGLTNNn-yiTTSiKVEDADKDTLvcgtnNGNpKcWKiDGSd 

50 S F A 

CPYDP(c) TVADFSG (d) 
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G LcpFdpDh nstAIYSEgQ lysAtvadf sgTdpLiyrGpl 

T LcpyNpEh nstsVSYNgQ lFsAtvadf sgGdpLiyrEpQ 

Dl VcpydpRh nstsVIADNE lysgtvadf sgSdpIiyrEpl 

D2 KcpydpLD nstAIYVENGNPGGLPGlysgtNaEf TKAdTViFrTDlYNTSAKRLEYKF 

5 H3 KSpydpKL LTAsLLIDgE lysgtAadf MgRdFAiFrT-lGHHHPIRTEQHD 

V4 dpKhRGRGYAPYQnsKVTIISHNGcYLSDINIsKEGIKRWRRFDGPcGYDl 

V5 MIY1 

N 

10 LNAPNFV(e) ( f ) FFFRETA EYINCGK(g) (h)DKGG 

G -rteRSdLkQ-lnapnfv-NTMEyNdFIFf f fretaveyincgkaiysrvarvckHdkgg 

T -rteLSdLkQ-lnapnfv-NsVAygdYIFf f YretaveyMncgkViysrvarvckDdkgg 

Dl -QteQYdSLS-lnapnfv-SsFtQgdFvyf f fretaveFincgkaiysrvarvckWdkgg 

D2 KrtLKYdSkW-lDKpnf v-GsFDIgEYvyf f f retaveyincgkaVysriarvckKdVgg 

15 H3 -SRWLNdpkF-ISaHLISESdNPEDdkvyf f f reNalDGEHSgkaTHAriGQIckndFgg 

V4 YTADNVIpkDGlRGA- f vDKdGty-dkvylLf TDtlG-SKRIVklPy — iaQMcLndEgg 

V5 YTADNVIpkDGlQGA-fvDKdGty-dkvylLf TVtIG-SKRIVkIPy--iaQMcLndECg 

20 SSY ( i ) V 

PH WTTFLKAR NCSIPG(j) 

G phQF-GDrwtsf IkSrIncsVpgDypfyf neiqs tsdllegNyGGQVEkliygv 

T phQ-SRDrwtsflkarlncsipgEypfyf Deiqs tsdlvegRyNsDDskliygl 

Dl phRF-RNrwtsf lkSrlncsipgDypfyf neiqs AsNLvegQyGsMSskliygv 

2;£k D2 KNLl-AhNwAtYlkarlncsiSgEFpfyf neiqs VYQL PsDKsRF-FAT 

;=! H3 -hRSLVNKwttf IkarllcsVpgPNGIDTHf -DeLq dVFLMNFKDPKNPWygv 

V4 pSSlSShrwStf lkVElEcDiDgRSYRQIIHSRTiKTDNDtILYvF--FDsPYsk 

■W- V5 pSSlSShrwStLlkVElEcDiDgRSYSQINHSKTiKQIMIRYYMYSLIVLFQVRIMYLFY 

m Y 

rl GSAVC(k) NSNWLPV(l) PRPGTCVND (m) 

UJ G f ttpVnSiGgsavcaf smKSiLESf DgPf keqETMnsnwlAvPSLKvpeprpgQcvndsr 

ffj T LttpVnAiGgsalcayQmAdiLRVfEgSfkHqETInsnwlpvPQNLvpeprpgQcvRdsr 

35 Dl fNtpSnSiPgsavcafALQdiADTfEgQfkeqTGInsnwlpvNNAKvpDprpgScHndsr 

D2 f ttSTnGLIgsavcSf HINEiQAAf NgKf keqSSSnsAwlpvLNSRvpeprpgTcvndTS 

Wj H3 f ttSSnlFKgsavcMysmSdVRRVf LgPYAHRDGPnYQwVp-YQGRvpYprpgTc — PsK 

flj V4 saLcTysmNTiKQSf STSKLeg YTKQLpSpApglcLPAGK 

V5 EYH 

U G T lpdVSVnf V- kShT lmdEAvpaF f TRpi 1 1 r I s lQyr f t Ki AvdQqvRt PDgKAYdvL f 

fU T IlpdKKVnf i-kThSlmED-vpaLf GKpVlVrVslQyrf tAiTvdPqvKtlNNQYLdvLY 

°1 AlpdPTLnf i-kThSlmdENvpaFfSQpilVrTsTIyrf tQiAvdAqIKtPGgKTYdvI f 

45 D2 NlpdTVLnf i-RShPlmdKAvNHEHnNpVYYKRDlVFTK-LWDKIRIDILNQEYI-vYY 

H3 TFGGFDSTKDlpdDVITf A-rshPAmYNPvFPMNnRpiVIKTDVNyQf tQiVvd-RvDAEDgQY-dvMf 

V4 WpHTTFEViEKYNVlDdllKp-LSnQpiFEGPSGVKWFDIKEKENEHREYRIYFIKENS 

50 G igtddgkvIkALnSAsFDSSDTvDSvVIeeLQvLPPGVpVKnlYVvr Mdg — d 

T igtddgkvLkAvnIPKRHAKALLYRKYRTSVHPHGA — pVKQlKIAP G 

Dl VgtdHgkllkSvnAEsADSADKvTSvVIeelDvLTKSEpIRnlEIvrTMQYDQPKdgSYd 

D2 VgtNLgRIYklvnGEsLSKLLDIFEvAPNeAIQVMEISQTR 

H3 igtdVgTvLkVvSIPKETWY-DLEEvLLeeMTvFREPTAISA MELSTK 

55 V4 iYSFdTkSKQTRSSQVDARLFSvMVTSKPLFIADIGIGVGMPQMKKILKM* 

DPYCAWD (n) 

G dSklWvSdDEiLAiKlhrcGSdkltNcRecvSlqdpycawdNVELKcTAVgSpDwSAG 
60 T YGk\AA/vGKDEiRLANlNHcAS-k-tRcKDcvElqdpHcawdAKQNLcVSIDTVTSY-- 

Dl dGkllivTdSQWAiQlhrcHNdkltScSecvAlqdpycawdKIAGKcRSHgApRw-LE 

D2 -KSlYiGTdHRiKQiDlAMc-NRRYDNcFRcv — RdpycGwdKEANTcRPY 

H3 QQQIYiGSTAGVAQLPlhrcDIYG-KAcAecCLARdpycawd — GSAcS RYFPTAK 

65 i 

G KrRFIqNISLgEH-KAcGGRPQTEIVASPVPTQPTTKSSGDPVHSIHQAEFEpeiDNEiVI 
T -rFLIqdvVRgDD-NKcWsPQTDKkTVIKNKPSEVENEITNSIDEKDLDsSdpLiKTGLdD 



40: 
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D ENYFYqNvATgQH-AAcPsGKINSkDANAGEGKGFKNDMDLLDSRRQ — sKdQeilDNidK 
D2 ELDLLqdvANETS-DIcDsSVLKKk 
H3 R rTRRqdIRNgDPLTHc SDLHDNHH 



ill 



GVddSNVIPNTLAEINHAGSKLPSSQEKlPiytaetlTialvTSCLGAlVvgf IsgFLFS 

DSdcDPVSENSIGGcAV RQQIViytaGtlHiVwWsiVGlFSWLYsgLSVF 

NFEdD IINAQytVetlVMavLAGsiFSlLvgf FTgYFCG 



rrcRGEDYTDMpFpdQRHQLNRLTEAGlNADsPYLPPCANnkAAInlvLNv PpkN 

AKFHSd SQypEAPFIEQHNHLERlsANQTGYLTPRAnk-AVnlwKvSSSTPRpkK 

rrcHKdEDDNLpypdTEYEYFEQRQNVNsFPsSCRIQQEPKLLPQVEEvTYAEPVLLpQP 

KKTYI (o) 

AnGKNANs S AENKP IQkktyi* 

DnLDVS KDLNI ASDGTLQKI kk ty i * 

PPPNKMHsPKNTLRKPPMHQMHQGPNSETLFQFHVTATTPSSRIWATTSEHCVPTR* 



IWTyg QsVHIGc FVkl P EV1 KNEQ vTwYHHS KDKG 

GHSPEERIIygVENSsTFlEcSPkSQRAl vYwQFQRRNEE 



□ 

£1 D2 rYelRYSPTKYiETtERglVWsVNEAdGgRyDchLGGSLLcSYNITVDAHRcTPPNKSN 

H3 r ^ e E-IRVDDHiIRtDQglLLRsLQQKdSgNyLchAVEHGFIQTLLKVTLEVIDTEHLEE 

yl ====================================== 



D2 DYQKI YSDWcHEFEKYKTAMKS WEKKQGQc STRQNF ScNQHPNE I FRKPNV* 

H3 LLHKDDDGDGSKTKEMSNSMTPSQKVWYRDFMQLINHPNLNTMDEFcEQVWKRDRKQRRQ 

H3 RPGHTPGNSNKHLQENKKGRNRRTHEFERAPRSV* 
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